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WHAT DOES THE SPECTRAL THEOREM SAY? 

P. R. HALMOS, University of Michigan 

Most students of mathematics learn quite early and most mathematicians 
remember till quite late that every Hermitian matrix (and, in particular, every 
real symmetric matrix) may be put into diagonal form. A more precise statement 
of the result is that every Hermitian matrix is unitarily equivalent to a diagonal 
one. The spectral theorem is widely and correctly regarded as the generalization 
of this assertion to operators on Hilbert space. It is unfortunate therefore that 
even the bare statement of the spectral theorem is widely regarded as somewhat 
mysterious and deep, and probably inaccessible to the nonspecialist. The pur- 
pose of this paper is to try to dispel some of the mystery. 

Probably the main reason the general operator theorem frightens most peo- 
ple is that it does not obviously include the special matrix theorem. To see the 
relation between the two, the description of the finite-dimensional situation has 
to be distorted almost beyond recognition. The result is not intuitive in any 
language; neither Stieltjes integrals with unorthodox multiplicative properties, 
nor bounded operator representations of function algebras, are in the daily tool- 
kit of every working mathematician. In contrast, the formulation of the spectral 
theorem given below uses only the relatively elementary concepts of measure 
theory. This formulation has been part of the oral tradition of Hilbert space for 
quite some time (for an explicit treatment see [6]), but it has not been called the 
spectral theorem; it usually occurs in the much deeper “multiplicity theory.” 
Since the statement uses simple concepts only, this aspect of the present formu- 
lation is an advantage, not a drawback; its effect is to make the spirit of one of 
the harder parts of the subject accessible to the student of the easier parts. 

Another reason the spectral theorem is thought to be hard is that its proof 
is hard. An assessment of difficulty is, of course, a subjective matter, but, in 
any case, there is no magic new technique in the pages that follow. It is the state- 
ment of the spectral theorem that is the main concern of the exposition, not the 
proof. The proof is essentially the same as it always was; most of the standard 
methods used to establish the spectral theorem can be adapted to the present 
formulation. 

Let 0 be a complex- valued bounded measurable function on a measure space 
X with measure /x. (All measure-theoretic statements, equations, and relations, 
e.g., “</> is bounded,” are to be interpreted in the “almost everywhere” sense.) 
An operator A is defined on the Hilbert space £ 2 (/x) by 

= cj>(x)f(x ), xE X; 

the operator A is called the multiplication induced by <£. The study of the rela- 
tion between A and <j> is an instructive exercise. It turns out, for instance, that 
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the adjoint A* of A is the multiplication induced by the complex conjugate $ 
of <f>. If also is a bounded measurable function on X> with induced multiplica- 
tion then the multiplication induced by the product function (j>\[/ is the prod- 
uct operator AB. It follows that a multiplication is always normal; it is Hermi- 
tian if and only if the function that induces it is real. (For the elementary con- 
cepts of operator theory, such as Hermitian operators, normal operators, projec- 
tions, and spectra, see [3]. For present purposes a concept is called elementary 
if it is discussed in [3] before the spectral theorem, i.e., before p. 56.) 

As a special case let X be a finite set (with n points, say), and let /z be the 
“counting measure” in X (so that [jl({x}) = 1 for each x in X). In this case 
<£ 2 (/z) is w-dimensional complex Euclidean space; it is customary and convenient 
to indicate the values of a function in <£ 2 (/z) by indices instead of parenthetical 
arguments. With this notation the action on / of the multiplication A induced 
by <j> can be described by 


A (fly • * * ) fn) — (</>l/l> y <l>nfn)» 

To say this with matrices, note that the characteristic functions of the single- 
tons in X form an orthonormal basis in «£ 2 (/x); the assertion is that the matrix 
of A with respect to that basis is diag (<j> 1 , • • • , <j> n )- 

The general notation is now established and the special role of the finite- 
dimensional situation within it is clear; everything is ready for the principal 
statement. 

Spectral Theorem. Every Hermitian operator is unitarily equivalent to a 
multiplication. 

In complete detail the theorem says that if A is a Hermitian operator on a 
Hilbert space 3C, then there exists a (real-valued) bounded measurable function 
0 on some measure space X with measure /x, and there exists an isometry U 
from £ 2 (ju) onto 3C, such that 

( U~ l AUf)(x ) = <Kx)f(x), xEX, 

for each/ in <C 2 0u). What follows is an outline of a proof of the spectral theorem, 
a brief discussion of its relation to the version involving spectral measures, and 
an illustration of its application. 

Three tools are needed for the proof of the spectral theorem. 

(1) The equality of norm and spectral radius. If the spectrum of A is A(^4), 
then the spectral radius r(A) is defined by 

r(A) = sup { | X| : X G A(^)}. 

It is always true that rG4)^||.4|| ([3, Theorem 2, p. 52]); the useful fact here 
is that if A is Hermitian, then r(A) = ||^4|| ([3, Theorem 2, p. 55]). 

(2) The Riesz representation theorem for compact sets in the line. If L is a posi- 
tive linear functional defined for all real-valued continuous functions on a com- 
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pact subset X of the real line, then there exists a unique finite measure /z on the 
Borel sets of X such that 


L(f) = f fd/x 

for all / in the domain of L. (To say that L is linear means of course that 

L(af+Pg) = aL(f) + /3L(g), 

whenever / and g are in the domain of L and a and f3 are real scalars ; to say that 
L is positive means that L(J)*z 0 whenever /is in the domain of L and/^0.) For 
a proof, see [4, Theorem D, p. 247]. 

(3) The Weierstrass approximation theorem for compact sets in the line . Each 
real-valued continuous function on a compact subset of the real line is the uni- 
form limit of polynomials. For a pleasant elementary discussion and proof see 

[1, p. 102]. 

Consider now a Hermitian operator A on a Hilbert space 3C. A vector £ in 
3C is a cyclic vector for A if the set of all vectors of the form g(/4)£, where q runs 
over polynomials with complex coefficients, is dense in 3C. Cyclic vectors may 
not exist, but an easy transfinite argument shows that 3C is always the direct 
sum of a family of subspaces, each of which reduces A, such that the restriction 
of A to each of them does have a cyclic vector. Once the spectral theorem is 
known for each such restriction, it follows easily for A itself; the measure spaces 
that serve for the direct summands of 3C have a natural direct sum, which serves 
for 3C itself. Conclusion: there is no loss of generality in assuming that A has a 
cyclic vector, say £. 

For each real polynomial p write 

up) = wax, q. 

Clearly L is a linear functional; since 

I L(p) | g \\p(A)\\ *|UII 2 = K/>(^)) -INI 2 

= sup { j X | : X6 A(*01))H|{||* 

= sup { | P(\) I : xe AGO }.||*||*, 

the functional L is bounded for polynomials. (The last step uses the spectral 
mapping theorem; cf. [3, Theorem 3, p. 55].) It follows (by the Weierstrass 
theorem) that L.has a bounded extension to all real-valued continuous functions 
on A(4). To prove that L is positive, observe first that if p is a real polynomial, 
then 

apum , & = ||^)£|| 2 ^ o . 

If /is an arbitrary positive continuous function on A(^4), then approximate y/f 
uniformly by real polynomials; the inequality just proved implies that L(/) ^0 
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(since / is then uniformly approximated by squares of real polynomials). The 
Riesz theorem now yields the existence of a finite measure /x such that 

(P(A)(, pdn 

for every real polynomial p. 

For each (possibly complex) polynomial q write 

Uq = q(A)t. 

Since A isHermitian, (g(4))*( = gC4)) is a polynomial in A, and so is (q(A))*q(A) 
( = | g| 2 (^)) ; it follows that 

J | q \*dn = (i (A)q(A)M) = ((q(A))*q(A)(, Q = || 3 (^)?|| 2 = \\Uq\\\ 

This means that the linear transformation U from a dense subset of £ 2 (/x) into 
3C is an isometry, and hence that it has a unique isometric extension that maps 
<£ 2 (ju) into 3C. The assumption that £ is a cyclic vector implies that the range of 
U is in fact dense in, and hence equal to, the entire space 3C. 

It remains only to prove that the transform of A by U is a multiplication. 
Write <£(X)=X for all X in A(^4). Given a complex polynomial q y write J(X) 
=Xg(X) =0(X)g(X); then 

U~ l AUq = U~ l Aq(A)k = U~ l q(A)t = = J. 

In other words U~ l A U agrees, on polynomials, with the multiplication induced 
by (j > , and that is enough to conclude that U~ l A U is equal to that multiplica- 
tion. This completes the outline of the proof of the spectral theorem for Her- 
mitian operators. 

The formulation of the spectral theorem given above yields fairly easily all 
the information contained in the more common versions. Thus if A is the multi- 
plication on <£ 2 (/x) induced by the real function <fi on X y and if F is a (complex) 
Borel measurable function that is bounded on A(^4), then F(A) can be defined 
as the multiplication induced by the composite function F o cj>. The mapping 
F-+F(A) is the homomorphism that is frequently known by the impressive 
name of “the functional calculus.” If, in particular, F=F M is the characteristic 
function of a Borel set M in the real line, and if E(M) is the multiplication in- 
duced by Fm o 0 , then E is the spectral measure of A . The verification that E 
is indeed a spectral measure is easy. To prove that it belongs to A (i.e., that 
A = /XJ£(X)), proceed as follows. Fix / and g in <£ 2 (ju) and write 

v{M) = (E(M)fj g) 

for each Borel set M\ it is to be proved that 

(Af, g) =f X dv(\). 
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whenever F is a simple function, and hence, by approximation, whenever F is a 
bounded Borel measurable function. This conclusion (for F(k)^z\) is just what 
was wanted. 

The multiplication version of the spectral theorem implies the spectral meas- 
ure version, but the latter is canonical (E is uniquely determined by A) whereas 
the former is not. Consider, for instance, the identity operator on a separable 
infinite-dimensional Hilbert space in the role of A. It is unitarily equivalent to 
multiplication by the constant function 1 on, say, the unit interval (with 
Lebesgue measure) ; it is also unitarily equivalent to multiplication by the con- 
stant function 1 on the set of positive integers (with the counting measure). 

There is a spectral theorem for normal operators also ; its statement can be 
obtained from the one given above by substituting “normal” for “Hermitian.” 
It is a well-known technical nuisance that the proof of the spectral theorem for 
normal operators involves some difficulties that do not arise in the Hermitian 
case. The source of the trouble is that it is not enough just to replace polynomials 
in a real variable by polynomials in a complex variable; the Weierstrass theorem 
demands the consideration of polynomials in two real variables. There is a con- 
sequent difficulty in extending the spectral mapping theorem to the kind of func- 
tions (polynomials in the real and imaginary parts of a complex variable) that 
arise in the imitation of the proof above. Even the equality of norm and spectral 
radius, while true for normal operators, requires a proof quite a bit deeper than 
in the real case. One way around all this is not to imitate the proof but to use 
the result. In [3, p. 72], for instance, the spectral theorem for normal operators 
(spectral measure version) is derived from the Hermitian theorem (spectral 
measure version) ; the only additional tool needed is an essentially classical ex- 
tension theorem for measures in the plane. 

In any case, all this talk about proof is somewhat beside the point in this 
paper. The reason a proof is outlined above is not so much to induce belief in 
the result as to clarify it. The emphasis here is not on how but on what, not on 
proof but on statement, not on How does the spectral theorem come about ? but on 
What does the spectral theorem say ? 

To see how the multiplication point of view can be used, consider the 
Fuglede commutativity theorem [2]. A possible statement is this: if A is nor- 
mal and if B is an operator that commutes with A, then B commutes with F{A) 
for each Borel measurable function bounded on A (^4). (An alternative state- 
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ment, only apparently weaker, is that if B commutes with A, then B commutes 
with A*; for a recent elegant proof see [5].) The spectral theorem shows that 
there is no loss of generality in assuming that A is the multiplication induced 
by </>, say, on a measure space X with measure jjl. If Fm is, for each Borel set M 
in the complex plane, the characteristic function of M, and if E(M ) is the multi- 
plication induced by Fm o 0, then it is sufficient to prove that B commutes with 
each E(M). (Approximate the general F by simple functions, as before.) If 
S(ikf) is the range of the projection E(M), then the desired result is that 8>(M) 
reduces B , but it is, in fact, sufficient to prove that S(M) is invariant under B . 
Reason: apply the invariance conclusion, once obtained, to the complement of 
M, and infer that both S(M) and (8(M)) J - are invariant under B. 

Observe now that S(M) is the set of all those functions in £ 2 (/x) that vanish 
outside and consider first the case of the closed unit disc, 

M = {\: | X| ^ l}; 

then 

<rw = {x: I 4>(x) I | lj. 

Assertion: S(Af) consists of all / in £ 2 (ju) for which the sequence { 1 1 Af\ \ , 
\\A% ||j 4 8 /||, • • • } is bounded. Indeed, if / vanishes outside then 

IM m /II 2 = f \r/\ *dn = f f |/|*4u. 

If, on the other hand, there is a set S of positive measure on which 0 and 
|0| >1, then 

\\A»f \\ 2 = J | rf\H,x ^ J |^|- 1/|*4* -» <o. 

The assertion is proved, and the invariance of 8>(M) under B follows: if 
\\A n f\\ ^ c for all n, then \\A n Bf\\ =\\BA n f\\ £\\B\\-\\A*f\\ g\\B\\ -c for all n. 

If M is any closed disc, | X — X 0 1 then 

= {#: | — Ao | ^ r) — | (x) | g l| . 

Since B commutes with multiplication by 0, it commutes with multiplication 
by (0— \o)/r also, and it follows from the preceding paragraph that S(itf) is 
invariant under B. 

The rest of the proof is easy measure theory ; from this point of view spectral 
measures behave even better than numerical measures. Since S(M) is invariant 
under B whenever M is a disc, the same is true whenever M is the union of 
countably many discs. This implies that S(M) is invariant under B whenever 
M is open, and hence (regularity) for arbitrary Borel sets M. 
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AN EXTENSION OF THE FERMAT THEOREM 

L. CARLITZ, Duke University 

1. If p is a prime, e ^ 1 and (a, p) = 1 then 

(1) a w see 1 (mod p e ) 
provided p e ~ 1 (p — 1) | w. It follows from (1) that 

(2) ]£ (-l) r_s () a n+8W = a n (a w ~ l) r = 0 (mod p re ) 

s=0 \S / 

for all 0. This congruence is useful for example in deriving Rummer’s con- 
gruence for the Euler and Bernoulli numbers [l, Ch. 14]. 

It may be of interest to examine the sum 

(3) A r a nk = X) (— l) r “*( r \ a^ n+sw)k , 

s=0 \S / 

where k is a fixed integer ^ 1. We remark that (3) is suggested by some recent 
work [2 ] on colored graphs. We shall prove 

Theorem 1. Let p e ~ 1 (p — 1) | w and let\^e be the largest integer such that 

(4) a w = 1 (mod p x ). 

Then 

(5) X) ( — l) r ~ s ( r ^ a {n+sw)1c s 0 (mod p^) y 

o \s / 

where 

(6) n = [(r + k - 1)/*], 
greatest integer ^(r+k — l)/k. 



